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Abstract 



In a recent paper, Michalowicz et al. provide an extension of Isserlis theorem to the case of a 
Bernoulli location mixture of a Gaussian vector. We extend here this result to the case of any 
location mixture of Gaussian vector; we also provide an example of the Isserlis theorem for a "scale 
location" mixture of Gaussian, namely the d— dimensional generalized hyperbolic distribution. 
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Oh 

i-^ ' 1. Introduction 

03 

Isserlis theorem, as discovered by Isserlis [1|| in 1918, allows to express the expectation of a 
monomial in an arbitrary number of components of a zero mean Gaussian vector X £ R d in terms 
of the entries of its covariance matrix only. Before providing in Thin Q] the slightly generalized 
version of Isserlis theorem due to Withers [3|, we introduce the following notations: for any set 
A = {«i, . . . , aw} of integers such that 1 < oti < d and any vector X £ R d , we use the multi-index 
^— s ' notation and denote 

CN ' a t eA 

t**** ■ with the convention that for the empty set 

o 

A pairing in a set A is a partition of A into disjoint pairs. We denote by II (A) the set of all pairings 
a in A: note that IT (A) is empty if A has an odd number of elements. For a given a £ II (^4), we 
denote by A/a the set {i; a — (i,a («))} ; finally, ^ YW 1 (XiXj) denotes the sum 

o-en(A) iGA/cr 

In other words, for a given pairing a in the set A, we compute the product of all possible moments 
E(XiXj) where i and j are paired by a; then, Y^Tl^ 1 (^iXj) denotes the sum of these products 
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over all possible pairings in A. As an example 

J2 n E ( X i X j) = E ( X 'i) E ( X 2X 4 ) + 2E (X X X 2 ) E (X1X4) ■ 

{1,1,2,4} 

A general form of Isserlis theorem, due to Withers, is as follows. 

Theorem 1. If A = \ot\, . . . ,Q!2jv} is a set of integers such that 1 < oti < d, Vi G [l,2iV] and 
X G R d is a Gaussian vector with zero mean then 

EX A = Y^U E (X l X J ) (1) 

A 

Moreover, if A — {a±, . . . ,a 2 N+i} then, under the same assumptions, 

EX A = 0. 
For example, choosing on = i, 1 < i < 4 yields the well-known identity 

E (X 1 X 2 X 3 X 4 ) = E (A!A 2 ) E (X 3 X 4 ) + E (^X-,) E (A 2 A 4 ) + E (AiA 4 ) E (X 2 X 3 ) . 
However, indices ct^ need not be distinct: for example, choosing a, = l, 1 < z < 4 yields 

Exf = mxf. 

Several extensions of this result have been provided recently: in [3j], Withers extends Isserlis 
theorem to the case of noncentral Gaussian vectors and relates the result with multivariate Hermitc 
polynomials; in [4|, a general formula for Gaussian scale mixtures, and more generally for ellip- 
tically distributed vectors is derived; it is applied to the computation of moments of the uniform 
distribution on the sphere. In [5], Isserlis theorem is extended to the computation of the moments 
of linear combinations of independent Student-t vectors. In [6|, Isserlis theorem is extended to 
Gaussian matrix mixtures, i.e. random vectors of the form 

X = AN 

where N is a standard Gaussian vector in R d and A is a (d x d) random matrix. Let us also 
mention the reference [8] where the author tackles the computational complexity of formula ([1]), 
using Magnus lemma to replace a product of n variables by sums of polynomials of degree n in 
these variables. 

Recently, Michalowicz et al. [2| addressed the case of Gaussian location mixtures: they provided 
an extension of Isserlis theorem to the case of a random vector X £~R. d with probability density 

fx (x) = -4>r {x + jj) + -<j>R (x - jj) (2) 

where 

^ R ( X ) = 10 n.i 6XP ( -o xtR ~ 1 '" 

\2irR\2 \ £ 

is the d— variate Gaussian density with zero mean and covariance matrix R. 

In the following, we give a new and simple proof of the result by Michalowicz et al., adopting a 
formalism that allows us to extend their results to the general case of an arbitrary Gaussian location 
mixture. We also provide an extension of these results to the case of a scale-location mixture of 
Gaussian. 



2. Extensions of the result by Michalowicz et al. 

A key observation is that the random vector X with density ([2]) reads 

X = e/i + C (3) 

where e is a Bernoulli random variable (Pr{e = —1} = Pr{e = 1} = i), ( £ K d is a zero mean 
Gaussian vector and equality is in the sense of distributions. This stochastic representation allows 
to prove easily a generalized version of the main result of [2j , namely 

Theorem 2. If X £ W 1 is distributed according to @) and A = {ai, . . . , Qf2Af} with 1 < on < d 
then 

N 

e ^=e e ^eii 1 ^)- 

fc=0 SCA A\S 

\S\=2k 

If A = {«i, . . . , tt2Ar+i} then EX a = 0. 

The simplified proof we propose is as follows: by <j3j) , 

EX A = E (e/i + iV) A 
and since the product (a + 6) A can be expanded as 

27V 

( a + b ) A = E E fl S & A\S, 
fe=0 SCA 

|S|=fc 



we deduce that 



2JV 

EX -4 = E E /^ ( £,£ ) E (Cas) 

fe=0 SCA 
|S|=fc 



By Isserlis theorem, the expectation of the product of an odd number of centered Gaussian random 
variables Q is equal to zero so that this expression simplifies to 



JY 



£ £ M S E(e 2 *)E(a\ S ). 



fe=0 ScA 

S|=2fe 



Since e is Bernoulli distributed, all its even moments are equal to 1; moreover, since £ has zero 
mean, by Isserlis theorem, E \Ca\s) = Y2 T\E (CiCj) an d we obtain 

A\S 



N 



ex a = J2 E msEII e ^o) 

fc=0 ScA A\S 

S|=2fe 

which is the desired result. The case where A has an odd number of elements is equally simple. 

We note that Theorem [2] can be also easily deduced using generating functions as done in [3j, 
Theorem 1.1] who proves a version of Wick's theorem for a Gaussian vector with mean fi =/= : 
choosing a Bernoulli randomized version of this mean as in (j3|) yields the result. 



3. The general case of Gaussian location mixture 

With the useful representation ([3]) , we can generalize the preceding result to any kind of location 
mixture of Gaussian: namely, we consider a random vector X £ M. d that reads 

X = M + C (4) 

where £ is a zero-mean Gaussian vector in M. d , independent of the random vector fi £ M. d with prob- 
ability distribution _F M ; note that the vector \i may be discrete - taking values /ii with probabilities 
Pi - or not, but we don't need to assume the existence of a density / M . In the discrete case, the 
density of X reads 

+oo 

fx (x) = ^2pi(/)R (x - fu) ; 

j=0 

and in the most general case, 



fx (x) = <Pr(x-h) dF^ (/i) . 



We now state our main theorem. 



Theorem 3. Assume that X £ M. d follows model Op and that all the first-order moments m^ = E/ij. 
of /! exist. Then if A — {a\ 1 . . . , «2Af+e} ■, with e £ {0, 1} , 



N 

E ^ = E E E(^)En E teO) (5) 

fe=0 SCA A\S 

\S\=2k+e 

We remark that if all elements a, of A are different and if the vector fj, has independent components, 
this expression can be further simplified to 

EJ ^ = E E oMsEIi^o), (6) 

fc=0 SCA A\S 

\S\=2k+e 

noting the difference between E(/Lts) = EJ| Q . eS /i Qi in © and (E/i) s = Y\ a . eS E[i ai in ©. 
The proof is as follows. 



Proof 1. From Q), we deduce 



2JV+e 



EX *= E E E(// s )E(a\s)- 



k=0 SCA 

\S\=k 

Since the cardinality of A\S is 2N + e — k, E ((a\s) = unless \S\ = k has the same parity as 
e, in which case it is equal to X}I1 E (CiCj) > hence formula [E\). Formula (0|) is easily deduced from 

A\S 

formula {5p assuming that the components of pi are independent and that all elements of A are 
distinct. 

We now provide a further generalization of Isserlis theorem by considering a Gaussian vector with 
both random scale and location parameters. 



4. A Normal variance-mean mixture application 

The generalized d— dimensional hyperbolic distribution was introduced by Barndorff-Nielsen in 

1978 [7|. It is the distribution of a random vector that reads 

X = /i + cr 2 A/3 + crA^C (7) 

where /j, and j3 are two deterministic vectors in R d , A is a deterministic (d x d) matrix with | A| = 1, 
£ is a standard Gaussian vector in R d and a 2 is a scalar random variable that follows the Generalized 
Inverse Gaussian GIG (ip, \i ^) distribution 

U, XlX (x) = 2 ^y x^ 1 exp (-± (xx- 1 + 1/jxf) , x > (8) 

with parameters tjj > 0, x > and A € R. We note that in (J7J|, the GIG random variable a 2 appears 
both as a scale and location parameter of the Gaussian vector, hence the "normal variance-mean 
mixture" name. From the stochastic representation ([7]). we derive a version of the Isserlis theorem 
as follows. 

Theorem 4. If X £ K d is a generalized hyperbolic vector as in $7$) and A — {a\ : . . . ,ct2N+e} with 
e e {0, 1} then 

EX A = ^ E VTls\Tm N+ i- p+t '^2 W&ZiZj 

n < / < N TCSCA A\S 

U V I V JV |T|=p,|S|=2J+e 

< p < 2^ + e 
where 7 = A/3, where Z is a centered Gaussian vector with covariance matrix A and 

.21 _ (i>\~^ K x+iWWx) 



mi = Ecr — , 1 

,x/ ^a(vVx) 

Proof 2. Assuming first e = 0, we have 

N N 



with 



1=0 SCA 

\S\=2l 






EZ A \ S 


Z=0 SCA 

S|=2Z 

=En Ez ^ 

A\S 
2; 






Ea 2N - 


" 2/ (m 


+ 


- 2 7)s = 


: Y^ E ^t75\tE (a 

p=0 TcS 
\T\=p 


2\N+l- 


-P 



and 



The moment of order I of the GIG random variable a 2 can be easily computed from |Sj) 

(±y * KxMV^xl 

hence the result. The case e = 1 follows the same steps. 
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